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I.  INTRODUCTION 


Importance  Sampling  for 
Characterizing  STAP  Detectors 

RAJAN  SRINIVASAN 
University  of  Twente 
The  Netherlands 

MURALIDHAR  RANGASWAMY 

Air  Force  Research  Laboratory  Sensors  Directorate 

This  paper  describes  the  development  of  adaptive  importance 
sampling  (IS)  techniques  for  estimating  false  alarm  probabilities 
of  detectors  that  use  space-time  adaptive  processing  (STAP) 
algorithms.  Fast  simulation  using  IS  methods  has  been  notably 
successful  in  the  study  of  conventional  constant  false  alarm 
rate  (CFAR)  radar  detectors,  and  in  several  other  applications. 
The  principal  objectives  here  are  to  examine  the  viability  of 
using  these  methods  for  STAP  detectors,  develop  them  into 
powerful  analysis  and  design  algorithms  and,  in  the  long  term, 
use  them  for  synthesizing  novel  detection  structures.  The  adaptive 
matched  filter  (AMF)  detector  has  been  analyzed  successfully 
using  fast  simulation.  Of  two  biasing  methods  considered,  one 
is  implemented  and  shown  to  yield  good  results.  The  important 
problem  of  detector  threshold  determination  is  also  addressed, 
with  matching  outcome.  As  an  illustration  of  the  power  of  these 
methods,  two  variants  of  the  square-law  AMF  detector  that  are 
thought  to  be  robust  under  heterogeneous  clutter  conditions  have 
also  been  successfully  investigated.  These  are  the  envelope-law 
and  geometric-mean  STAP  detectors.  Their  CFAR  property 
is  established  and  performance  evaluated.  It  turns  out  the 
variants  have  detection  performances  better  than  those  of  the 
AMF  detector  for  training  data  contaminated  by  interferers.  In 
summary,  the  work  reported  here  paves  the  way  for  development 
of  advanced  estimation  techniques  that  can  facilitate  design  of 
powerful  and  robust  detection  algorithms. 
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Estimation  of  false  alarm  probabilities  (FAPs) 
of  detection  algorithms  that  employ  space-time 
processing  is  examined  here  using  forced  Monte 
Carlo  (MC)  or  importance  sampling  (IS)  simulation. 
Space-time  adaptive  processing  (STAP)  algorithms  are 
of  much  importance  for  radar  detection  [1,2].  They 
are  notoriously  intensive  from  a  computational  point 
of  view,  with  the  more  advanced  (and  robust)  ones 
being  also  analytically  difficult  to  quantify.  Therefore, 
it  is  appropriate  to  attempt  to  develop  fast  simulation 
methods  that  could  be  used  in  their  analysis  and 
design. 

In  the  following  we  use  lessons  learned  from 
developing  IS  techniques  for  characterizing 
conventional  constant  false  alarm  rate  (CFAR) 
detectors  and  describe  an  experiment  in  applying 
them  to  STAP  detection.  The  starting  point  of  this 
effort  is  the  celebrated  adaptive  matched  filter  (AMF) 
detector  derived  in  [2]  and  which  represents  the  array 
version  of  the  workhorse  cell-averaging  (CA)  CFAR 
detector  for  conventional  radar  signal  processing 
algorithms.  The  FAP  performance  of  the  AMF 
detector  is  known  in  integral  form  under  certain 
conditions  and  can  be  numerically  computed  to  any 
desired  accuracy.  Thus  it  forms  a  suitable  basis  for 
validating  our  simulation  experiments.  Two  specific 
IS  methods  (described  in  the  sequel)  are  presented 
and  the  better  (and  also  easier)  one  is  implemented. 

As  a  demonstration  of  the  power  of  IS  methods, 
the  envelope-law  and  geometric-mean  detectors 
are  presented  and  analyzed  using  fast  simulation. 
These  are  variants  of  the  AMF  detector  and  their 
FAPs  are  not  known  in  analytical  form.  Their 
CFAR  property  is  established  and  FAP  behaviour 
characterized  using  IS.  A  brief  comparison  of 
detection  performances  of  all  3  detectors  is  made.  In 
the  long  term,  our  aim  is  to  develop  and  apply  these 
fast  simulation  techniques  to  modern  STAP  detection 
algorithms  (see  [3]— [5],  and  references  therein).  An 
important  goal  in  this  context  is  to  devise  IS  biasing 
methods  that  result  in  simulation  times  that  grow 
slowly  with  decreasing  FAPs.  This  remains  an  open 
problem. 

As  well  known  now,  IS  is  the  chief  simulation 
methodology  for  rare-event  estimation.  It  is  an 
enduring  method  that  has  had  success  in  several  areas 
of  science  and  engineering,  [6].  Briefly,  IS  works  by 
biasing  original  probability  distributions  in  ways  that 
accelerate  the  occurrences  of  rare  events,  conducting 
simulations  with  these  new  distributions,  and  then 
compensating  the  obtained  results  for  the  changes 
made.  The  principal  consequence  is  that  unbiased 
probability  estimates  with  low  variances  are  obtained 
quickly.  The  main  task  in  IS  therefore  is  determination 
of  good  simulation  distributions  for  an  application. 
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Simulations  performed  using  such  distributions 
can  provide  enormous  speed-ups  and,  if  applied 
successfully,  simulation  lengths  needed  to  estimate 
very  low  probabilities  can  become  (only)  weakly 
dependent  on  the  actual  probabilities.  When  these 
probabilities  satisfy  a  large  deviations  principle,  then 
several  asymptotic  results  are  available  for  devising 
simulation  distributions  [6].  For  most  detection 
applications  however,  it  appears  that  adaptive  methods 
which  attempt  to  minimize  error  variances  might  be 
better  suited.  There  have  been  some  recent  attempts 
in  the  literature,  for  example  [7]  and  [8],  to  apply  IS 
for  FAP  estimation  of  CFAR  detectors  with  varying 
degrees  of  success.  Our  work  uses  results  developed 
in  [9]  and  [10],  with  an  adaptive  implementation 
of  the  so  called  g-method.  Recent  theoretical  and 
application  work  on  this  method  can  be  found  in  [11] 
and  [12].  The  groundwork  for  the  present  results  was 
laid  down  in  [15]  which,  to  our  knowledge,  is  the  first 
article  on  the  topic  of  using  IS  for  studying  STAP 
detection  algorithms. 

During  the  conducting  of  simulations  reported 
herein,  some  issues  concerning  the  adaptive  IS 
algorithms  used  arise,  and  these  are  discussed  briefly. 
The  positive  outcome  of  the  methods  used  is  that 
excellent  match  with  numerical  results  is  obtained. 

The  succeeding  sections  provide  a  brief  statement  of 
the  detector  tests,  how  IS  biasing  can  be  performed  to 
hasten  false  alarm  events,  description  of  the  g-method 
which  is  a  conditional  IS  technique  developed 
originally  for  studying  sums  of  random  variables  [9], 
how  inverse  IS  can  be  used  to  estimate  (and  choose) 
detector  thresholds,  a  methodology  for  using  the 
fast  adaptive  algorithms,  simulation  results,  and  a 
conclusion. 

II.  STAP  DETECTOR  VARIANTS 

In  a  radar  system  consisting  of  a  linear  array  of  Ns 
antenna  elements  a  burst  of  N \  pulses  is  transmitted, 
resulting  in  a  return  of  NsNt  =  N  samples  in  each 
range  gate.  The  N  complex  samples  are  arranged 
in  an  N  x  1  column  vector  and  there  are  L  +  1 
such  returns.  The  range  gate  return  to  be  tested  for 
the  presence  of  target  is  called  the  primary  data 
and  is  denoted  by  x  while  the  remaining  vectors 
are  known  as  the  training  (or  secondary)  data, 
assumed  to  be  free  of  target  signal,  and  denoted 
by  x(/),  l  =  1,...,L.  A  target  return  is  modelled  as 
consisting  of  an  N  x  1  steering  vector  s  with  an 
unknown  complex  amplitude  in  addition  to  clutter, 
interference,  and  noise.  The  primary  and  secondary 
data  vectors  are  assumed  to  be  jointly  independent  and 
zero-mean,  spherically  symmetric  complex  Gaussian, 
sharing  the  N  x  N  covariance  matrix  R  =  I^XX*}, 
where  the  superscript  f  denotes  complex  conjugate 
transpose. 


A.  AMF  (Square-Law)  STAP  Detector 

Under  these  assumptions  the  AMF  detection  test, 
as  obtained  in  [2],  is  given  by 


sfR-'x2"; 
stR-is  k 


(1) 


where 

~  1  ^ 

R=iEx(/)xwt 

L  i= i 

is  the  estimated  covariance  matrix  of  x  based  on  the 
secondary  data  (also  referred  to  as  sample  matrix), 
and  r]  is  a  threshold  used  to  set  the  FAP  at  some 
desired  level.  This  test  has  the  CFAR  property.  The 
FAP  a  of  the  test  is  known  to  be  given  by 

L\  rx^-N+\ l-xf-2 

a~  (L-N  +  mN-2)\J0  (1+7 1x/LY'-N"  X 

(2) 

which  can  be  used  to  numerically  determine  the 
threshold  setting  for  a  desired  FAP.  As  shown  in  [2], 
the  test  in  (1)  can  be  rewritten  as 

.  A  ,  ,  H\  ,  A  , 

Is^R- *x|2  ^77StR~1s 

Ho 

=  ^|stR-lx(/)|2.  (3) 

L  1=1 

This  is  in  the  form  of  a  vector  (or  array)  version  of 
the  usual  CA-CFAR  test.  The  left-hand  side  (LHS)  is 
a  square-law  detector,  being  the  output  of  a  matched 
filter  (matched  to  the  direction  s  in  which  the  array  is 
steered)  for  incoherent  detection  using  the  so-called 
sample  matrix  inversion  (SMI)  beamformer  weights 
R~’s.  The  right-hand  side  (RHS)  represents  a  cell¬ 
averaging  term.  Further  details  on  these  issues  can  be 
found  in  the  references  mentioned  above. 


B.  AMF  (Envelope-Law)  STAP  Detector 

In  contrast  to  square-law  detectors,  it  is  known 
to  radar  engineers  that  envelope  detectors  possess 
some  robustness  properties  in  terms  of  detection 
performance  when  the  training  data  is  contaminated 
by  outliers  or  inhomogeneities.  Accordingly,  the 
envelope-law  STAP  version  of  the  AMF  (abbreviated 
hereafter  as  E-AMF)  detector  is  proposed  here  and  its 
detection  properties  evaluated.  It  takes  the  form 

IstR-'xlt^WR-^QI  (4) 

*  L  tt 

where  tjE  denotes  the  threshold  multiplier.  An 
analytical  expression  or  approximation  for  the  FAP 
of  this  detector  is  not  known. 
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C.  Geometric-Mean  STAP  Detector 

Also  proposed  here  is  the  geometric-mean 
STAP  version  of  the  AMF  detector  (abbreviated 
as  GM-STAP).  Conventional  CFAR  detectors  that 
calculate  the  geometric  means  of  the  range  cells  in  the 
CFAR  window  (usually  referred  to  as  log-r  detectors) 
are  known  to  have  robustness  properties.  The  STAP 
variant1  takes  the  form 

h  (  L  \ 1/L 

|stR-1x||%fn|stR-1x(/)|J  (5) 

where  r)G  denotes  the  threshold  multiplier.  Note  that 
the  GM-based  square-  and  envelope-law  versions  are 
identical  (except  for  a  trivial  squaring  of  threshold 
multiplier).  For  the  sake  of  completeness,  the  value 
of  the  multiplier  as  the  number  of  training  vectors 
L  — >  oo  is  calculated.  In  such  a  case,  R_1  -^->R~', 
stR~1x-^->stR-1x,  and  stR_1x(Z)-^->stR_1x(Z)  in 
the  absence  of  target.  As  s+R  ’x  and  stR_1x(Z)  are 
independent  and  identically  distributed  (HD)  and 
distributed  as  CA(1(0,stR_1s),  it  follows  that  the 
squared  envelopes  |stR_1x|2  and  |stR_1x(Z)|2  are 
exponentially  distributed,  each  with  mean  stR^s. 
Using  convergence  arguments  based  on  continuity 
[14],  it  is  straightforward  to  show  that  the  asymptotic 
FAP  of  the  GM  detector  is  exp(— r?gexp(— 7))  where  7 
is  the  Euler  constant.  Hence  the  (asymptotic)  threshold 
r)G  required  to  provide  a  FAP  of  10~6  is  4.9605.  A 
similar  calculation  can  be  easily  made  for  the  E-AMF 
detector. 

Both  the  E-AMF  and  GM-STAP  detectors  have  the 
CFAR  property  under  the  assumption  of  homogeneous 
Gaussian  characterization  as  stated  in  the  beginning 
of  this  section;  their  FAPs  are  independent  of  the 
true  target-free  covariance  R.  This  is  shown  in 
Appendix  A. 


III.  FAP  ESTIMATION  USING  IS 

This  section  describes  procedures  to  quickly 
estimate  the  FAPs  and  threshold  multipliers  of  STAP 
detectors  using  IS.  The  standard  (square-law)  AMF 
detector  is  used  for  exposition.  The  detector  variants 
are  handled  using  parallel  arguments. 

The  actual  form  of  biasing  considered  here  is 
simple  scaling.  While  other  techniques  (superior  in 
terms  of  simulation  gains)  can  be  devised,  scaling 
is  easy  to  implement  and  yields  conservative  but 
reasonable  results. 


'This  test  was  actually  suggested  in  [16]  but  its  performance  was 
not  evaluated. 


A.  Square-Law  AMF  Detection 

Two  strategies  to  estimate  FAPs  using  IS  are 
two-dimensional  (2-D)  biasing  and  the  conditional 
g-method  procedure,  described  in  this  section.  The 
2-D  biasing  scheme  parallels  the  approach  used  in 
previous  works  on  conventional  CFAR  algorithms,  as 
cited  in  the  introduction.  It  can  be  useful  in  situations 
wherein  the  more  powerful  g-method  might  be 
difficult  to  apply. 

1)  2-D  Biasing:  To  estimate  FAP  using  IS, 
we  make  the  following  observations.  Suppose  each 
complex  sample  of  a  secondary  vector  is  scaled 
by  a  real  (and  positive)  number  01/2.  This  has  the 
effect  of  scaling  the  covariance  matrix  estimate  R 
by  9.  Therefore,  as  far  as  the  covariance  estimate 
is  concerned,  both  sides  of  the  test  in  (3)  remain 
unaffected  by  the  scaling.  However,  each  secondary 
vector  being  scaled  by  01/2  results  in  an  additional 
scaling  of  the  RHS  by  6.  Hence  choosing  6  less  than 
unity  will  have  the  effect  of  compressing  the  density 
function  of  the  random  threshold  of  the  test.  Further, 
a  scaling  of  each  complex  component  of  the  primary 
vector  by  a  real  positive  a1/2  will  achieve  a  scaling  of 
the  LHS  of  the  test  by  a.  Thus,  choosing  a  larger  and 
6  smaller  than 'unity  will  achieve  an  increase  in  the 
frequency  of  occurrence  of  false  alarm  events  during 
simulation.  The  IS  optimization  problem  will  be  a 
two-parameter  one. 

Denoting  by  A  the  false  alarm  event  in  (3),  the 
unbiased  IS  estimator  in  an  HD  simulation  can  be 
expressed  as 


1  K 

a  =  -£[l(.A)W(x,xL;a,0)]«,  x,xL 
1  =  1 

(6) 

where  K  is  the  length  (or  number  of  trials)  of  the 
IS  simulation,  1(  )  denotes  the  indicator,  XL  = 

(x(l ),..., x(L))r,  and  the  notation  ~  means  that  all 
random  variables  are  drawn  from  biased  distributions. 
The  weighting  function  W  is  described  below.  In 
setting  up  the  joint  densities  of  X  and  XL,  we  use 
the  fact  that  the  FAP  of  the  AMF  has  the  CFAR 
property  and  is  independent  of  the  true  covariance 
matrix  R.  This  is  so  under  the  assumption  of 
Gaussian  distributions  for  the  data.  In  such  a  case,  the 
simulation  of  the  AMF  test  can  be  carried  out  for  data 
possessing  a  diagonal  covariance  matrix  I,  denoting 
the  N  x  N  identity  matrix.  Therefore,  primary  and 
secondary  data  can  be  generated  as  complex  vectors 
with  independent  components.  The  unbiased  joint 
densities  are 


/(x)  = 


and  f(xL)  = 


~Zln 


so  that 


— 
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With  scaling  performed  as  described  above,  the  biased 
joint  density  takes  the  form 

A(X>XZ.)  =  n(L+l)N  aN  QLN 


resulting  in  the  weighting  function 
W(x,xL;a,0)  =  ^(x,x^ 


/*(x>xl) 

=  CaN9LNeA/aeB/e 


(7) 


where 


A  =  xfx,  B  =  y~]x(/)tx(/)  and  C  =  e(A+B). 

(8) 

The  variance  of  the  IS  estimator  a  can  be  expressed  as 

vara  =  \;[l(y)  -  a2]  (9) 

K. 


with  v  denoting  the  vector  biasing  parameter  ( a,9)T  € 
[l,oo)  x  (0, 1].  The  quantity  /  is  given  by 

I(y)  =  E.{\{A)W\x,xL-,u)} 

=  E{KA)W{x,xl-v)}  (10) 


where  the  expectation  E t  proceeds  over  biased 
distributions.  Minimization  of  vara  with  respect  to 
the  biasing  parameters  is  equivalent  to  minimization  of 
/  and  is  described  in  Appendix  B. 

2)  The  g -Method  Estimator.  This  method  exploits 
knowledge  of  underlying  (input)  distributions  more 
effectively,  yielding  a  more  powerful  estimator. 
Additional  advantages  are  that  only  a  scalar  parameter 
optimization  problem  needs  to  be  tackled  and  the 
inverse  IS  problem  (for  threshold  optimization  or 
selection)  can  be  easily  solved.  The  FAP  can  be 
written  as 


a  =  P(A) 

=  E{P(\s^R~1x\2  >  iptR-'s  |  Xl,H0)} 

=  E{g(XL)}  (11) 


where  g(XL)  denotes  the  conditional  probability  in 
the  second  step.  The  conditioning  implies  that  the 
covariance  matrix  estimate  R  is  given.  We  proceed 
to  estimate  a  using  the  form  in  the  third  step  above. 

With  the  conditioning  in  mind  it  is  easy  to  show, 
assuming  that  x  is  rotationally  invariant  and  Gaussian, 

that  the  random  variable  s^R-1x  =  w^x  is  distributed 
as  CA/jtO.wtRw)  with  independent  real  and  imaginary 
components,  and  the  weight  vector  w  =  R~'s.  Hence 
the  random  variable  Y  =  |s+R._1x|2  is  exponential  and 
has  the  density  function 

„-j/wfRw 

o)  =  -^r’ 


Therefore 

g(Xjr.)  =  P(Y  >  tjs*R~1s  |  Xl,H0) 

=  e-”D 

where 

£)  _  stR-1s 
—  w+  Rw  ‘ 

Note  that  if  R  =  R,  then  g(XL)  =  e~v  and  this  is  the 
FAP  of  the  AMF  when  the  covariance  matrix  is 
known.  As  discussed  before,  we  run  simulations  with 
homogeneous  data  possessing  an  identity  covariance 
matrix,  that  is,  with  R  =  I.  The  g- method  IS  estimator 
then  takes  the  form 


1  K 

at  =  f|>(Xl)W(xl;0)]« 
1  K 


'/*  (12) 


i=l 


with  D  given  by 


D  = 


slR-'s 


w 


slR-^ 

st(R-i)2S‘ 


(13) 


Now,  choosing  the  (single)  biasing  parameter  6  <  1 
produces  a  decrease  in  D,  thereby  causing  a  higher 
frequency  of  occurrence  of  the  false  alarm  event  or, 
more  appropriately  in  this  case,  a  larger  value  of  the 
g-function.  Note  that  use  of  the  g -method  obviates  the 
need  to  bias  primary  data  vectors.  Determination  of  a 
good  value  of  6  proceeds  as  described  in  Appendix  B. 
The  weighting  function  is  simply 

W(xL-,9)  =  eLNe-(l-'/e)B  (14) 


which  can  be  deduced  from  (7)  by  setting  a  =  1 .  The 
variance  of  this  estimator  is  given  by 

varas  =  i[/g(0)-a2]  (15) 

where 


/x(0)  =  £Jg2(XL)W2(XL;0)} 

=  E{g2(XL)W(XL-,9)}.  (16) 


The  minimization  of  (an  estimate  of)  Ig  with  respect 
to  the  biasing  parameter  9  is  carried  out  using  the 
recursion 


em+1=9m-6„J 


m—  1,2,...  (17) 


which  is  just  a  one-dimensional  version  of  (35)  in 
Appendix  B.  The  reader  is  again  referred  to  this 
appendix  for  definitions  of  quantities  in  the  summands 
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Fig.  1.  Example  objective  function  (for  E-AMF  detector). 


of  the  following  estimators  for  Ig  and  its  derivatives 
(with  respect  to  9)  that  are  used  in  (17).  These  are 
respectively  given  by 

1  K 

V*)=tf&2(Xi)W2(Xi:<,)](0 

1  K 

i'g(0) = x  (is) 

A  i=i 

1  K 

i*,W  =  zg[g2(X,)W(x,;0)lVfl(,(x,;0)]« 

where  XL  ~  ft  in  all  3  estimators  above. 

3)  Threshold  Determination:  The  converse 
problem,  namely  that  of  finding  by  fast  simulation  the 
value  of  detector  threshold  rj  satisfying  a  prescribed 
FAP,  is  an  important  task  that  can  be  readily  carried 
out  using  the  estimator  of  (12).  The  genesis  of  the 
method  lies  in  the  so-called  inverse  IS  problem  first 
formulated  and  solved  in  [9].  The  solution  is  to 
minimize  a  “squared  performance  error”  stochastic 
objective  function 


where  is  a  step-size  parameter  and  the  derivative 
estimator  in  the  denominator  is  given  by 


1  ' 

a'g(vm)  =  £[D^DW(xL;0)](i),  ~  L 


(20) 


with  the  prime  indicating  derivative  with  respect  to  r/. 
Note  that  this  derivative  estimator  actually  estimates 
(negative  of)  the  probability  density  function  of  the 
AMF  statistic  on  the  LHS  of  (1)  under  the  hypothesis 
that  target  is  absent.  The  threshold-finding  algorithm 
of  (19)  is  a  key  component  of  the  fast  simulation 
methodology. 

4)  Simulation  Gain:  A  useful  measure  of  the 
effectiveness  of  any  IS  procedure  is  the  simulation 
gain  T .  It  is  the  ratio  of  simulation  lengths  required 
by  conventional  MC  and  IS  estimators  to  achieve  the 
same  error  variance.  The  variance  of  an  MC  estimator 
(tV  =  1)  is  given  by  (a  -  a2)/KMC  for  a  simulation 
length  Kmc.  Hence,  using  (9),  the  IS  gain  for  the 
estimator  of  (6)  is 


P  XMc  a -a2 
K  /(i/)- a2 


(21) 


with  l(u)  defined  in  (10).  Similarly,  using  (15),  the 
g- method  estimator  of  (12)  has  gain  Tg  over  MC 
estimation  given  by 


T 


s 


a  — a2 
Ig(9)~a2 


(22) 


where  Ig(9)  is  given  by  (16).  Note  that  setting  9  =  1  in 
the  above  provides  gain  of  the  g-method  estimator 
without  IS.  The  estimator  always  has  a  smaller 
variance  and  consequently,  Tg  >  T.  Comparing  (21) 
and  (22),  it  is  sufficient  to  show  that  /  <  /  to  prove 
this.  This  is  done  in  Appendix  C. 

As  seen  in  the  sequel,  estimation  of  gain  during 
simulation  plays  an  important  role  in  mechanizing  an 
IS  methodology  that  allows  rare  events  to  be  studied 
quickly.  In  particular,  Tg  is  estimated  as 


f 


s 


(23) 


J(v)  =  lag(v)  ~  ot0f 

where  a0  is  a  desired  FAP.  A  typical  example  is 
shown  in  Fig.  1  for  an  E-AMF  detector.  All  detection 
algorithms  that  involve  a  threshold  crossing  will 
possess  objective  functions  that  have  the  general 
behaviour  shown  for  a0  <  1,  assuming  of  course  that 
the  FAP  estimate  is  (in  a  stochastic  sense)  monotone 
in  the  threshold  r].  Minimization  of  J  with  respect  to  r] 
is  carried  out  by  the  algorithm 


Vm+ 1 


=  Vm+6V 


ao  ~  Otg(.Vw) 

<%(7j 


m  =  1,2,...  (19) 


wherein  the  required  estimates  are  given  in  (12)  and 
(18).  The  denominator  of  the  above  equation  is  related 
to  an  estimate  of  the  IS  variance,  denoted  as  varag. 
Substituting  (12)  and  (18)  into  (23)  and  performing  a 
little  algebra  yields 


Ig(9)-a2g 

varQg  —  "  K 


2F>  ED*-?/ 


i=l  7=1 
¥J 


>  0  (with  probability  1) 


(24) 
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where  yt  =  [g(XL)W(xL;0)]®.  The  variance  estimate  is 
asymptotically  unbiased  and  it  is  easy  to  show,  using 
(15)  and  (24),  that  vam?  -^->vaiag  — >  0  as  K  — +  oo,  for 
any  fixed  9. 

If  the  IS  estimator  of  (6)  is  used  instead  of  (12), 
then  yt  =  [l(-4)W(x,xL;a,0)](i)  and  the  inequality 
in  (24)  will  not  be  strict,  leading  to  a  non-zero 
probability  of  instability  in  gain  estimation.  One 
way  of  trying  to  prevent  this  from  happening  is 
to  overbias  the  simulation,  but  this  can  result  in 
underestimation  of  the  FAP.  Such  instabilities  do  not 
occur  with  the  g -method  estimator  and  this  is  one  of 
its  implementation  advantages. 

B.  E-AMF  and  GM-STAP  Detection 

False  alarm  probability  IS  estimators  for  these 
two  detector  variants  are  set  up  in  completely  parallel 
fashion,  starting  with  the  expression  in  (12).  The  only 
difference  is  in  the  definition  of  the  quantity  D  (given 
in  (13)  for  the  AMF  detector),  while  the  threshold 
multipliers  get  squared.  This  follows  by  observing 
from  (3),  (4),  and  (5)  that  all  3  detectors  studied  here 
differ,  primarily,  on  the  RHSs  of  their  respective  tests. 
Consequently,  denoting  by  DE  and  Da,  respectively, 
the  D  values  for  the  two  variants,  this  leads  to 

_  (d/£-) Ef=  i  |stR-1x(Z)|)2 
E~  st(R-i)2s 

and 

D  _  (nf=iistR-1x(oi2)1/L 

°  “  st(R-i)2s 

The  corresponding  FAP  estimators  are  obtained 
by  using  the  above  definitions  in  (12)  with  g(XL) 
replaced  by  exp(-7?|D£)  and  exp (-t?£Dg), 
respectively.  The  threshold-finding  algorithm  in  (19) 
is  altered  accordingly. 

IV.  MECHANIZING  THE  IS  ALGORITHM 
A.  Methodology 

The  various  issues  of  the  preceding  sections 
are  summarized  in  an  IS  simulation  methodology 
that  outlines  the  principal  steps  required  for 
implementation  of  the  adaptive  algorithms.  It  is 
a  cautious  methodology  which  has  been  used  in 
previous  works  on  applications  of  IS. 

Of  interest  is  the  extreme  but  realistic  (and  often 
encountered)  situation  where  we  have  no  knowledge 
whatsoever  of  the  FAP  a  of  a  detection  algorithm 
for  a  given  value  of  threshold  r 7.  In  such  a  case, 
referring  to  the  objective  function  of  Fig.  1,  the  basic 
idea  of  fast  adaptive  simulation  is  to  travel  down 
the  curve  from  its  left  side.  An  initial  ( a0,r)0 )  pair 


is  needed.  It  is  easily  obtained  using  conventional 
MC  simulation  for  a  high  value  of  a0,  say  0.1  (and  a 
correspondingly  low  value  of  t]0),  using  K  =  100/ao  = 
1000  trials  according  to  the  well-known  MC  thumb 
rule.  This  can  be  accomplished  quickly  with  a  few 
experiments.  The  ( a0,r)0 )  pair  (whose  accuracy  need 
not  be  very  high)  provides  the  starting  values  for  the 
IS  procedure.  It  begins  by  forming  the  estimate  Ig(9) 
at  the  threshold  t]b  (using  1000  trials)  and  locating 
its  minimum  (possibly  graphically)  as  a  function  of 
the  biasing  parameter  9.  It  is  advisable  to  search  for 
the  optimum  bias,  denoted  as  9a,  starting  from  9  =  1 
to  avoid  locating  an  overbiasing  minimizer.  This 
leads,  via  (23),  to  an  estimate  T  of  the  maximum 
IS  gain  available  (with  a  possible  correction  for  a) 
and  results  in  the  quadruplet  (a0,rj0,90,T  (9a)).  At 
this  stage  the  IS  adaptations  are  implemented.  The 
threshold-finding  algorithm  of  (19)  is  implemented 
with  initial  value  r]a  for  a  new  prespecified  a0  slightly 
smaller  than  the  initial  a0  and  using  a  conservative 
number  of  K  -  100 /(a0T(90))  for  the  IS  trials.  It  is 
conservative  because  we  know  that  simulation  gain 
increases  with  decreasing  rare-event  probability  and 
hence  this  number  is  guaranteed  to  provide  better 
than  the  rule-of-thumb  accuracy  at  the  new  a0.  Note 
that  most  of  the  IS  gain  can  be  leveraged  if  the  FAP 
decrements  are  kept  small.  The  biasing  algorithm  of 
(17)  is  implemented  simultaneously  using  9a  as  initial 
value.  At  the  end  of  the  adaptations  the  resulting  gain 
is  estimated  for  updating  K  and  a  new  quadruplet  is 
obtained.  This  is  continued  until  we  have  a  complete 
characterization  of  false-alarm  behaviour  of  the 
detector  down  to  the  desired  a0.  The  procedure  is 
summarized  below. 

1)  Implementation :  Define  the  set  such 

that  0.1  =  >  o£2)  >  ■  •  •  >  aP  =  10~7,  for  example. 

Preprocessing: 

Step  1  p  =  1.  Use  1000  conventional  MC  trials  to 
obtain  (a^,^1*)  pair. 

Step  2  Find  9P  =  argmin glg(9)  and  calculate 
f  «(fl0»>)  using  (23). 

Adaptive  IS: 

Step 3  p  =  p+  1. 

Step  4  Let  m  =  1.  Set  9 j>>  =  9^~l\  rtf  =  r)^ 
and  K  =  100 /(o^f  ^(fl^-1*)). 

Step  5  Generate  K  secondary  vectors  and 
compute  B  and  D  using  (8)  and  (13). 

Step  6  Implement  (12),  (18)  using  biased 
secondary  vectors  with  parameter  0%\ 

Step  7  Compute  9(p+l  and  Vmh  using  (17)  and 
(19);  if  both  algorithms  have  converged,  set  9fp  = 

C+v  =  tflv  f 1 w0«)  =  f  «(0  using  (23), 

and  go  to  step  3  or  stop  if  p  =  P;  otherwise  go  to  step 
5  with  m  =  m  +  1. 

At  the  end  of  simulations  we  have  the  P  quadruplets 

{oS’).ijjr),^),fw(flW)}f. 
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B.  Complexity  Reduction  for  IS  Adaptations 

The  methodology  is  simple  enough  but  certain 
observations  can  be  made  which  lead  to  an  artifice 
that  further  reduces  computational  effort  to  a 
considerable  extent.  In  rare-event  simulation  of 
highly  reliable  systems  that  involve  complex  signal 
processing  operations,  two  factors  contribute  to 
simulation  time.  The  first  is  the  rare  event  itself  that 
is  under  study;  this  is  handled  by  suitable  IS  biasing 
techniques.  The  second  factor  is  the  computational 
intensity  of  the  signal  processing  of  the  system. 

In  STAP  detectors  the  chief  processing  burden  is 
from  inversion  of  large  matrices.  Several  millions 
of  MC  trials  with  as  many  matrix  inversions  are 
needed  to  estimate  low  FAPs.  Using  an  IS  scheme 
the  number  of  trials  can  be  reduced  appreciably,  for 
a  single  rare-event  probability  estimation.  However, 
there  remains  the  important  matter  of  executing  the 
IS  adaptations  of  (17)  and  (19),  using  this  small 
number  of  IS  trials  for  each  iteration.  These  can  be 
computationally  burdensome  if  the  recursions  are  to 
be  carried  out  for  several  a0  values.  This  is  where 
the  artifice  comes  in.  It  turns  out,  in  contrast  with 
conventional  MC  simulation,  that  adaptations  can  be 
performed  by  reusing  the  unbiased  random  quantities 
generated  for  any  one  set  of  trials.  Consequently  it 
is  not  necessary  to  run  truly  randomized  adaptive 
IS  algorithms.  The  result  will  be  a  large  savings  in 
computational  effort. 

This  idea  is  illustrated  by  rewriting  the  FAP 
estimator  of  (12)  in  the  form 

ag  =  ^^[e-^^e_(fl-1)Su](0,  (25) 

A  i=i 

obtained  by  substituting  (14)  into  (12),  replacing 
D  and  B,  respectively,  by  their  unbiased  versions 
denoted  as  Dv  and  Bv,  and  observing  that  whereas 
the  estimator  in  (12)  uses  samples  drawn  from 
biased  distributions  the  mathematically  equivalent 
estimator  of  (25)  operates  with  only  unbiased 
quantities,  obtained  via  /.  Biasing  is  handled  by 
the  (explicit)  presence  of  9.  The  number  of  trials 
remains  unchanged,  at  K.  The  estimators  required  in 

(17)  and  (19)  can  be  rewritten  in  a  similar  manner. 

It  is  a  straightforward  exercise  and  is  omitted. 
Assuming  then  that  the  value  of  K  is  large  enough 
for  estimation  of  the  target  a0  (and  this  is  guaranteed 
by  the  methodology  described  above),  it  follows  that 
the  set  of  K  instances  of  Dv  and  Bv  employed  in  (25) 
can  be  repeatedly  used  in  the  adaptations.  With  the 
complexity  of  frequent  regeneration  thus  removed, 
these  adaptations  (such  as  minimization  of  Ig(9)  in 

(18) )  can  be  extremely  fast,  often  requiring  only  a  few 
iterations.  It  is  assumed  that  a  sufficiently  large  pool 
of  unbiased  variables  is  pregenerated  to  accommodate 
K  trials. 


There  are  two  related  issues  that  are  briefly 
discussed  here  in  qualitative  terms.  In  IS  simulations 
carried  out  here  (and  elsewhere),  it  is  tacitly  assumed 
that  the  number  of  trials  K  is  also  large  enough  to 
accurately  estimate  IJ9).  This  quantity  is  admittedly 

A 

not  a  rare-event  probability  and  its  estimator  Ig(9), 
given  in  (18),  is  clearly  unbiased.  Nevertheless, 
its  accuracy  should  be  studied  in  terms  of  its  error 
variance,  irrespective  of  whether  we  employ  the 
above  described  reuse  strategy  or  not.  In  this  regard 
it  must  be  pointed  out  that  the  biasing  densities  used 
for  IS  simulations  are  being  chosen  to  minimize 
Ig(9)  (or  its  estimate  Ig(9)),  and  not  to  estimate 
it  accurately.  Indeed,  attempting  to  estimate  I  (9) 
accurately  would  lead  us  to  consider  further  biasing 
of  the  original  biasing  distribution.  As  probably 
known  to  IS  researchers,  the  more  powerful  a  biasing 
distribution  is  (in  the  sense  of  being  “close”  to  the 
unrealizable  optimal  biasing  density  that  estimates  the 
rare-event  probability  exactly),  the  more  “constant” 
will  be  the  HD  random  variables  yt  defined  after 
(24).  Interestingly  this  implies  that  even  for  small 
K,  the  estimated  variance  vara  can  become  small 
for  such  biasing  distributions.  The  latter  observations 
accentuate  the  need  to  search  for  good  simulation 
distributions.  Unfortunately,  it  is  beyond  the  remit 
of  this  papier  to  investigate  the  variance  and  rate  of 
convergence  of  Ig(9)  and  the  dependencies  of  these 
on  biasing  distributions  and  number  of  trials  K.  Some 
remarks  on  this  issue  can  be  found  in  [16].  We  have 
confirmed  the  legitimacy  of  the  reuse  strategy  in 
several  cases  in  two  ways.  Using  an  overkill,  that  is, 
employing  a  large  number  of  IS  trials  for  varying  9, 
it  has  been  ascertained  that  the  correct  value  of  the 
optimum  bias  does  not  differ  noticeably  from  the 
90  determined  as  above.  Secondly,  truly  randomized 
optimization  algorithms  have  been  used,  with  the  same 
conclusion. 

The  second  issue  concerns  the  ease  with  which  the 
form  of  the  estimator  in  (25)  is  obtained.  The  reason 
lies  in  the  biasing  technique  used.  Biasing  in  IS  can 
be  classed,  for  the  purpose  of  this  discussion,  into  two 
types.  One  kind  derives  biased  quantities  by  direct 
parameterized  transformations  of  unbiased  variables, 
such  as  the  scaling  employed  here.  The  second  is  just 
the  complementary  class,  where  such  transformations 
do  not  exist.  Examples  of  the  latter  can  include  new 
distributions  for  conducting  simulations  and  some 
instances  of  distributions  based  on  large  deviations 
theory  arguments.  Within  the  first  class,  only  the 
subset  of  transformations  that  are  many-to-one  will 
allow  placing  the  IS  estimator  in  a  form  such  as  that 
of  (25). 

V.  SIMULATION  RESULTS 

Implementation  results  of  the  fast  simulation 
procedures  using  the  g-method  are  described  here. 
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AMF:  L  =  704,  N  =  352 
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recursions 

Fig.  2.  Threshold  optimization  for  AMF  detector 
( L  —  704 ,N  -  352)  using  inverse  IS. 

A  typical  example  of  evolution  of  the 
threshold-finding  algorithm  is  shown  in  Fig.  2 
for  the  square-law  AMF  detector  with  L  =  704 
training  vectors  and  space-time  product  N  =  352. 
Clearly,  about  20  recursions  of  (19)  appear  sufficient 
for  convergence.  Note  also  from  this  figure  that 
the  “converged”  value  of  the  threshold  for  an  ag 
acts  as  the  initial  value  for  the  next  lower  a0,  as 
described  in  Section  IVA.  Although  not  shown  here, 
similar  estimation  results  have  been  obtained  for  the 
E-AMF  and  GM-STAP  detectors.  The  results  for 
the  AMF  detector  almost  “coincide”  with  numerical 
computations  of  (2).  As  analytical  expressions  or 
approximations  for  the  FAPs  are  not  available,  such  a 
comparison  is  not  possible  for  the  variants.  Threshold 
multiplier  values  for  the  3  detectors  are  summarized 
in  Fig.  3  for  L  =  128  and  N  =  64.  For  the  E-AMF  and 
GM-STAP  detectors  the  interpolations 

tje  =  0.0095*3  -  0.1713*2  +  1.8869*  +  1.6873 

rjc  =  0.01 1  lx3  -  0.1985*2  +  2.2351*  +  1.9899 

for  L  =  128  and  N  =  64,  and 

r]E  =  0.01*3  -0.1865*2  +  1.8822*  +  1.7255 

T]C  =  0.01 18*3  -  0.2198*2  +  2.2292*  +  2.0408 

for  L  =  704  and  N  =  352,  obtained  from  inverse  IS 
results  where  *  =  -log10a,  can  be  used  for  estimating 
multipliers  for  FAPs  in  the  range  10_1  to  10-7. 

Performance  graphs  of  the  IS  algorithms  are  in 
Figs.  4  and  5,  which  show  the  estimated  IS  gain 
T  obtained  over  MC  simulation  and  the  number 
of  trials  required,  respectively.  The  number  of  IS 
trials,  although  appreciably  reduced  in  comparison 
with  MC,  increases  rapidly  with  decrease  in  FAP. 

From  these  two  figures  it  can  be  observed  that,  for 
a  FAP  of  10~6,  the  square-law  AMF  with  L  =  128 
and  N  =  64  requires  1500  IS  trials  whereas  only  100 
trials  are  needed  for  L  =  704  and  N  =  352.  This  can 
be  attributed  to  the  fact  that  D  and  B,  the  only  random 


Fig.  3.  Multipliers  for  L  =  128,  and  N  =  64.  Starred  dots  indicate 
validations  for  E-AMF  detector  using  conventional  MC 
simulations. 


Fig.  4.  IS  gains.  Set  of  3  lower  graphs  correspond  to  L-  128, 
N  =  64. 
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SNR  dB 


Fig.  6.  Detection  performances  in  homogeneous  clutter.  L  =  128, 
N  =  64,  ao  =  10-6. 


SNR  (dB) 


Fig.  7.  Detection  with  2  interfering  targets  in  training  data. 
L  =  128,  N  =  64,  oca  =  lO"6. 


quantities  that  appear  in  (12),  have  more  concentrated 
density  functions  in  the  latter  case,  thus  causing  the  IS 
biasing  to  be  more  effective  during  simulation. 

Detection  probabilities  for  all  three  detectors  have 
been  estimated  and  compared  at  a  FAP  of  10-6.  For 
homogeneous  clutter  backgrounds  and  nonfluctuating 
(Swerling  0)  and  fluctuating  (Swerling  1)  targets, 
the  results  are  in  Fig.  6.  The  SNR  loss  of  the  two 
variants  compared  with  the  AMF  detector  is  very 
slight,  the  maximum  being  0.3  dB  for  a  Swerling  0 
target.  Performance  degradations  for  training  data 
contaminated  by  nonhomogeneities  consisting  of 
interfering  targets  that  are  assumed  to  have  the  same 
Doppler-angle  properties  and  characteristics  as  the 
primary  target  are  shown  in  Fig.  7  for  two  Swerling 
0  and  Swerling  1  targets.  Each  interferer  is  assumed 
to  have  the  same  steering  vector  and  power  as  the 
primary  target.  Similar  results  (not  shown  here)  have 
been  obtained  for  different  numbers  of  interferers.  It 
is  evident  that  the  GM-STAP  detector  is  most  robust 
in  the  presence  of  interferers,  enjoying  (in  some 
cases)  several  decibels  of  power  advantage  over  the 
square-law  AMF  detector.  Consequently,  its  FAP 
performance  in  these  situations  (though  not  evaluated 
here)  should  also  be  relatively  robust. 

A.  Comments 

Despite  the  accuracy  of  our  results,  biasing  by 
scaling  has  not  done  as  well  for  the  variants  as  for 
the  AMF  detector.  Furthermore,  even  for  the  latter,  IS 
performance  should  be  improved  by  a  better  choice  of 
biasing  scheme  in  order  to  achieve  the  goal  of  having 
the  required  number  of  trials  to  be  constant,  or  at 
least  increasing  very  slowly,  with  decrease  of  FAP. 
This  is  certainly  a  matter  for  further  investigation  that 
may  pay  dividends  while  examining  other  detector 
configurations.  Scaling  was  used  in  this  work  in  the 
interest  of  expediency.  Some  details  regarding  the 


behaviour  of  the  scaling  parameter  9  are  available 
in  [15].  Briefly,  it  is  close  to  unity  and  has  a  small 
spread  over  the  range  of  FAPs  considered.  This  is  due 
to  the  shape  of  the  density  functions  of  B  and  D. 

VI.  CONCLUSION 

A  humble  inroad  into  the  use  of  adaptive  IS 
algorithms  to  characterize  a  STAP  detector  has  been 
made.  The  AMF  detector  was  used  for  validation 
and  results  have  been  good.  The  chief  reasons  for 
this  are  that  we  were  able  to  invoke  the  g -method 
and  inverse  IS,  use  a  biasing  strategy  that  could  be 
easily  optimized  adaptively,  and  find  a  way  around 
the  difficult  task  of  inverting  large  matrices  several 
times  during  simulations.  As  a  small  demonstration 
of  the  potential  of  IS,  two  AMF  detector  variants  that 
are  known  to  be  relatively  obdurate  to  mathematical 
analysis  have  been  suggested  and  characterized. 
Sufficient  numerical  results  have  been  provided 
here  in  order  to  motivate  interested  researchers  to 
confirm  our  findings  and  possibly  carry  out  their 
own  investigations  into  the  subject.  There  are  some 
technical  issues  regarding  IS  estimators  that  remain  to 
be  settled;  however,  the  development  of  better  biasing 
methods  is  an  important  matter.  In  any  case,  our  hope 
is  that  application  of  these  fast  simulation  techniques 
to  more  advanced  STAP  configurations  will  also  meet 
with  success.  But  this  remains  to  be  seen  as  we  are 
certainly  not  in  position  to  predict  what  subtleties  (and 
difficulties)  these  other  detection  algorithms  can  throw 
up.  It  is  clear  that  IS  is  still  in  its  infancy,  especially 
insofar  as  its  use  for  characterizing  modern  detection 
algorithms  is  concerned. 

APPENDIX  A.  CFAR  PROPERTY 

An  invariance  property  is  established  that  can  be 
used  to  show  that  certain  STAP  detection  algorithms 
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have  FAPs  that  do  not  depend  on  the  data  covariance 
R.  The  proposition  given  here  follows  quite  simply 
from  arguments  contained  in  the  exposition  of  the 
generalized  likelihood  ratio  STAP  detector  test 
(Kelly’s  GLRT)  given  in  [1],  They  are  outlined  here 
for  convenience  of  the  reader.2  Assume,  as  before,  that 
the  primary  and  training  data  vectors  have  the  same 
covariance.  Consider  the  variables 

G  =  sfR_1x  and  G(Z)  =  s+R-'x(Z)  (26) 

for  Z  =  1  Using  the  transformations  u  =  R_1/2s, 
y  =  R_1/2x,  and  y (Z)  =  R~'/2x(Z),  leads  to 

G  =  utR-'y  and  G(Z)  =  ufR  ~ly(l)  (27) 

where  R  =  R-1/2RR-1/2.  The  whitened  vectors  Y 
and  Y(Z)  are  both  distributed  CJfN(0,I).  It  turns 
out  that  R  has  the  complex  Wishart  distribution 
CW(L,N \(l / L)T),  [13].  Further,  a  unitary 
transformation  U  can  be  found  which  rotates  the  new 
signal  vector  u  into  an  elementary  vector  e  as  de  = 
UV  such  that  e  =  [l,0,...,0]t  and  where  d2  =  u^u  = 
stR*'s.  The  first  column  of  U  is  the  new  signal  vector 
u/d.  The  remaining  columns  comprise  an  orthonormal 
basis  determined,  for  example,  by  a  Gram-Schmidt 
procedure.  Let  z  =  Ufy  and  z(Z)  =  Ufy(Z).  Applying 
these  to  (27)  yields  the  variables 

G  =  ^et5“1z  and  G(Z)  =  ^e+5_1z(Z) 

(28) 

where  S  =  LU^RU.  While  Z  and  Z (Z)  are  distributed 
as  CNn(0,I)  and  are  independent,  S  has  the 
distribution  CV\>(L,N\Y).  The  vectors  z  and  z (Z)  are 
decomposed  as  z  =  [zA  zBf  and  z(Z)  =  [zA(l)  z B(Z)]r 
where  the  A  components  are  scalar  and  B  components 
(N  —  l)-vector.  Correspondingly,  S  is  decomposed  as 


L 

S  =  £z(Z)z(Z)+  = 
/=! 


'Saa 

Sba 


(29) 


so  that  Sm  ~  S/= i  za(^)zb(0^>  Sbb  —  22/Li  zb(0zb(0^ 
and  so  on.  Also 


where 


AA 


AB  : 


3 

e* 

II 

1 

<0 

(30) 

1  —  (*^AA  ^AB^BB^BA)  * 

=  _T>  C  C-l 

(31) 

AA^AB'-’BB 


which  follow  from  the  Frobenius  relations  for 
inversion  of  block  matrices,  [1],  Substituting  (30)  and 


2It  would  be  helpful  for  the  reader  to  refer  to  [1]  (see  also  [2]). 

We  have  used  several  results  from  this  now  classic  paper  and  have 
attempted  to  maintain  the  same  notation. 


(31)  in  (28)  gives 

G  =  j^aaY  and  G(Z)  =  ^y(Z)  (32) 

where 

L 

y  =  l A  ~  XX(0zB(0f<SBBZB 

/=1  L  (33) 

y(l)  =  zA  (Z)  -  ^ZA{i)zB(i)^SBlzB{l). 

1=1 

Conditioned  on  the  vectors  zB  and  zB(Z),  it  follows 
that  the  random  variables  Y  and  Y(l)  in  (33)  are  (in 
the  absence  of  target)  zero  mean  Gaussian.  With  a 
little  more  algebra  it  can  be  shown,  as  in  [1],  that  they 
are  uncorrelated  with  variances 

£B{|T|2}  =  l+z^gzB(Z)zB(Z)tj  zB 

and 


EBmn\2}  =  i  -  zB(of  ^zB(i)zfl(o^  zB(z) 

for  Z  =  1,...,L, 'with  EB  denoting  conditional 
expectation.  Further,  the  conditional  covariance  of  the 
variables  Y(l)  is  given  by 

EB{Y(k)Y(nY}  =  -zB(n?  ^gzB(Z)zB(Z)^  zB(k) 
for  k^n. 

Hence  the  set  of  conditionally  jointly  Gaussian 
zero  mean  random  variables  Y  and  (K(Z)}[  have 
individual  variances  and  covariances  that  are  functions 
of  the  random  vectors  zB  and  {zB(Z)}j.  The  latter 
are  all  jointly  independent,  each  being  distributed  as 
CA/^_](0,I).  The  probability  of  any  event  defined  on 
the  random  variables  Y  and  (T(Z)}^  in  (33)  can  thus 
be  determined  by  pierforming  an  averaging  operation 
over  the  distributions  of  zB  and  {zB(Z)}f  and  this 
probability  will  be  independent  of  the  data  covariance 
R.  This  statement  is  also  true  for  the  random  variables 
G  and  (G(Z)}f  in  (32)  with  the  caveat  that  any 
constant  scaling  of  these  variables  should  leave  the 
event  unchanged.  The  preceding  arguments  therefore 
constitute  proof  of  the  following. 

PROPOSITION  Any  STAP  detection  algorithm  that  uses 
only  the  random  variables  G  and  {G(Z)}f  defined  in 
(26)  for  its  description  such  that  the  algorithm  itself 
is  unchanged  by  arbitrary  but  equal  scaling  of  all 
these  variables,  has  a  FAP  which  is  independent  of  the 
target-free  data  covariance  R 

It  follows  immediately  from  this  that  both  the 
E-AMF  and  GM-STAP  detectors  have  FAPs  that  are 
independent  of  the  covariance  matrix  R. 


282 


IEEE  TRANSACTIONS  ON  AEROSPACE  AND  ELECTRONIC  SYSTEMS  VOL.  43,  NO.  1  JANUARY  2007 


APPENDIX  B.  ADAPTIVE  ALGORITHMS  FOR  2-D 
BIASING 

From  the  first  line  of  (10),  the  /-function  is 
estimated  as 

1  K 

i(u)  =  -J2[HA)w2(x,xL;V)Yt>,  -/*  (34) 

1=1 

and  its  minimization  can  be  carried  out  using  the  2-D 
adaptive  algorithm 

"m+l  =l,m-^Jm1V/(^m),  m  =  1,2,...  (35) 

Here,  S  is  a  step-size  parameter  used  to  control 
convergence,  and  m  is  the  index  of  iteration.  This  is  a 
stochastic  Newton  recursion.  It  achieves  minimization 
of  /  by  estimating  a  solution  of 

V/(i/)  =  (/a  Ig)T  =  0 

where  Ia  =dl(y)/da  and  I6  =dl(v)/d6.  The  estimate 
of  the  Jacobian  J  (which  is  the  Hessian  matrix  of  /)  is 


where  /  =  dlx/dy.  Starting  with  the  second  line  of 
(10),  the  various  /-functions  defined  above  can  be 
obtained  by  using  the  notational  equations 

Ix  =  E{l(A)dW/dx} 

=  £*{1M)WJ 

and  similarly 

Ixx  =  EAKA)WWxx} 

Ixy=E.{HA)WWxy} 

with  the  corresponding  weighting  function  derivatives 
easily  calculated,  using  (7),  as  Wx  =  dW /dx,  Wxy  = 
d2W jdxdy,  and  so  on.  These  /-functions  can  be 
estimated  as  in  (34). 

APPENDIX  C.  GAIN  OF  THE  g-METHOD 
ESTIMATOR 

The  proof  given  here  is  in  the  context  of  STAP 
detectors  and  is  a  generalization  of  the  one  in  [10] 
for  conventional  CFAR  detection  algorithms.  A  more 
complete  generalization  for  estimators  in  an  arbitrary 
rare-event  setting  is  available  in  [11]. 

When  no  IS  is  used  (W  =  1),  then  Ig  = 

£{g2(XL)}  <  E{g(Xt)}  =  a.  To  show  that  Ig  <  I  = 
I(y)  (the  biasing  vector  is  omitted  for  convenience) 
with  IS,  some  additional  notation  is  useful.  Let  U  = 
|stR_1X|2  and  let  V  denote  the  RHS  of  any  of  the 
three  STAP  detectors,  excluding  the  multiplier.  We 
bear  in  mind  that  U  is  a  function  of  X  and  XL,  and 


V  a  function  of  XL.  Then,  in  the  absence  of  target, 
a  =  P(U  >  rjV )  and  (10)  can  be  written  as 

/  =  £{1(//>7?V)W(x,xl)} 

with  W  defined  in  the  first  line  of  (7).  For  the 
g-method  estimator,  we  can  write 

g(Xj,  )  =  P(U  >T]V\Xl  =  Xjr  ) 

=  E{\(U  >  rjV)  |  XL  =  xL} 

=  J  1(m  >  »?v)/(x  |  xL)dx 

=  J  l(u  >  vv)Wc(x,xL)ft(x  |  xL)dx 
where  Wc(x,xL)  =  f(x  \  xL)/fJx  \  xL).  Then 

g2(xL)<  J  1(«  >  vv)Wc2(x,xL)ft(x  |  xL)dx 

=  J \(u>r]v)Wc(x,xL)f(x\xL)dx 

by  Jensen’s  inequality.  Therefore  Ig,  defined  in  (16),  is 
given  by 

Ig  =  J  g2(xL)W  (xL)f(xL)dxL 

<  j  j  l(u>r)v)Wc(x,xL)W(xL)f(x\xL)f(xL)dxdxL 

=  J  J  l(u>r)v)W(x,xL)f(x,xL)dxdxL 
=  /. 
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